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Abstract 

We present the spectrum of the (normalized) graph Laplacian as a 
systematic tool for the investigation of networks, and we describe basic 
properties of eigenvalues and eigenfunctions. Processes of graph formation 
like motif joining or duplication leave characteristic traces in the spectrum. 
This can suggest hypotheses about the evolution of a graph representing 
biological data. To this data, we analyze several biological networks in 
terms of rough qualitative data of their spectra. 



1 Introduction 

Modern biological data are often represented in terms of graphs. Microarray 
data may lead to graphs whose vertices are genes and whose edges stand for 
correlations, hypothetically interpreted as interactions. In the study of the pro- 
teome of a cell, one sees protein-protein interaction networks. Likewise, at a 
higher level, cell-cell interactions naturally lead to interaction graphs. A partic- 
ular example are neural networks where the vertices stand for neurons and the 
edges for synaptic connections. In populations, graphs encode networks of inter- 
actions between individuals, and in ecosystems, trophic and other interactions 
between species. A special case are phylogenetic trees that express descendence 
relations between species. 

The natural question then is how biological content can be extracted from these 
formal structures, the graphs to which the biological data are reduced. In graph 
theory, many concepts have been developed that capture various quantitative 
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or qualitative aspects of a graph (for an algebraic, graph theoretical approach, 
see e.g. [T31I5]. for statistical mechanics methods see e.g. [H HI] ) • Recently, 
a power law behavior of the degrees has become quite popular as it seems to be 
rather ubiquitous in biological and other data ([7]). Another powerful invariant 
of the graph is its first eigenvalue that provides estimates for how difficult it is to 
cut up the graph into disjoint components (see [TU], or for how easily dynamics 
at the vertices can get synchronized ([26] HJJ [2UJ, Uj) and many other articles). 
Useful as any such individual invariant may be, however, it cannot capture all 
the qualitative aspects of a graph. For example, graphs with the same degree 
distribution can have a completely different synchronizability ([2J [3]). Also, 
by their very nature, universal properties like a power law degree distribution 
capture what is common to large classes of graphs, but fail to identify what is 
specific about graphs from a particular domain, and what distinguishes those 
graphs qualitatively from those from other fields. 

Therefore, in this contribution, we advocate a set of graph invariants that, on 
one hand, can be easily graphically represented and therefore visually analysed 
and compared, and on the other hand, yields an essentially complete qualita- 
tive characterization of a graph. This is the spectrum of the graph Laplacian 
([2IJ|23j[18l[19l|33l|8l|6]). In [5], we have applied this method to the study of 
protein-protein interaction networks. 



2 The spectrum of a graph 

Let r be a finite and connected graph with N vertices. Vertices i,j € T that are 
connected by an edge of T are called neighbors, i ~ j. The number of neighbors 
of a vertex i £ T is called its degree rii . For functions v from the vertices of T 
to ffi, we define the (normalized) Laplacian as 

Av(i) := v(i) — ^ v(j). (1) 

3,3 ~ '» 

(Note that this operator is different from, and in particular, has a different spec- 
trum than the operator Lv(i) :— riiv(i)—J2j j^i v (j) usually studied in the graph 
theoretical literature as the (algebraic) graph Laplacian, see e.g. [£fl rHll2"Tll2"51 l8]. 
but has the same spectrum as the Laplacian investigated in [10]. The normalized 
Laplacian is the operator underlying random walks on graphs, and it naturally 
incorporates a conservation law.) 

We are interested in the spectrum of this operator as yielding important 
invariants of the underlying graph T and incorporating its qualitative properties. 
As in the case of the algebraic Laplacian, one can essentially recover the graph 
from its spectrum, up to isospectral graphs. The latter are known to exist, but 
are - arguabljj^]- relatively rare and qualitatively quite similar in most respects 

1 For example, most trees are not uniquely determined by their spectrum. 
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(see [33J for a survey). For a heuristic algorithm for recovering a graph from 
the spectrum of its algebraic Laplacian which can be easily modified for the 
normalized Laplacian, see |15j . 

We now recall some elementary properties, see e.g. [TUl HO]- The normalized 
Laplacian, henceforth simply called the Laplacian, is symmetric for the product 

(u, v) := nju(i)v(i) (2) 
iev 

for real valued functions u, v on the vertices of F. A is nonnegative in the sense 
that (Ait, u) > for all u. 

The eigenvalues of A therefore are real and nonnegative, the eigenvalue equation 
being 

Au-\u = 0. (3) 

A nonzero solution u is called an cigcnfunction for the eigenvalue A. Since L has 
N vertices, the function space on which A operates is TV-dimensional. There- 
fore, it has N eigenvalues; some of them might occur with multiplicity > 1. The 
eigenfunctions for an eigenvalue A constitute a vector space whose dimension 
equals the multiplicity of the eigenvalue A. In the sequel, when we describe an 
cigcnfunction, this is to be taken as some suitable element of this vector space. 
The smallest eigenvalue is Ao = 0, with a constant eigenfunction. This eigen- 
value is simple because we assume that F is connected; in general, the multi- 
plicity of the eigenvalue equals the number of connected components, with the 
corresponding eigenfunctions being = 1 on one and = on all other components. 
Returning to our case of a connected graph T, then 

A fe > (4) 

for k > where we order the eigenvalues as 

A = < Ai < ... < Ajv-i- 

After the brief discussion of the smallest eigenvalue, 0, we now turn to the 
largest one; here, we have 

Xn-i < 2, (5) 

with equality iff the graph is bipartite. Thus, a single eigenvalue determines the 
global property of bipartiteness. In fact, it is also true that a graph is bipartite 
iff whenever A is an eigenvalue, then so is 2 — A. In other words, a bipartite graph 
has a spectrum that is symmetric about 1, and this characterizes bipartiteness. 
It is also instructive to look at the spectrum of particular graphs. For example, 
for a complete graph of N vertices, we have 

iV 

Ai = - = Ajv-i = N _ 1 > ( 6 ) 

that is, the eigenvalue -^-j- occurs with multiplicity N — 1. Among all graphs 
with N vertices, this is the largest possible value for Ai and the smallest pos- 
sible value for Aat_i. Again, this spectral property fully characterizes complete 
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graphs. 

The preceding examples concern exact values for the eigenvalues. In contrast, 
qualitative properties of a graph are usually characterized by inequalities for its 
eigenvalues, an issue that we shall return to below. 

3 Eigenfunctions 

When we think of a graph T representing biological data as a structure that has 
evolved from some simpler precursors, for example by joining smaller graphs 
into a larger one, or by duplicating certain sets of vertices in a precursor graph, 
it is important to find some indications of this process in the spectrum of T. It 
turns out that also certain properties of eigenfunctions can be useful here. We 
shall now describe some such aspects in formal terms (for some details, we refer 
to [6]). 

In some cases, a solution Uk of the eigenvalue equation 

Au k - AfcUfc = (7) 

can be localized, that is, be outside a small set of vertices. In other cases, it has 
to be global, that is, be only at relatively few vertices. These are qualitative 
notions, but they provide some insight into the behavior of graphs under certain 
operations as we shall now explore. 

The considerations will depend on the eigenvalue equation (|3|, rewritten as 

— ^ u(j) = (1 - \)u(i) for all i. (8) 

We observe that when the cigcnfunction u vanishes at i, then also 

£<?) = o. ( 9 ) 

The converse also holds, except for the case A = 1 when ^ holds at all points 
regardless of whether the eigenfunction vanishes there or not. 
We start with constructions that lead to localized eigenfunctions. We think of 
a motif as a small graph whereas the graph T is supposed to be large. This is 
not at all necessary for the subsequent constructions, but is in the spirit of the 
term "localized". 

1. Motif joining: Let To be another graph, jo a vertex of Tq, with an 
eigenvalue A and an eigenfunction u x that vanishes at jo, i.e., u x (jo) = 0. 
When we then form a graph T by identifying the vertex jo with an arbi- 
trary vertex i of T, the new graph To also has the eigenvalue A, with an 
cigcnfunction that agrees with u x on r and vanishes at the other vertices, 
that is, those coming from T. Thus, a motif To can be joined to an exist- 
ing graph with a preserved eigenvalue and a localized cigcnfunction when 
the joining occurs at one (or several) vertices where that eigenfunction 
vanishes. 
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2. Motif duplication: Let Li be a motif in L, that is, a (small) subgraph 
of r, with vertices ji, ■ ■ ■ ,j m - Let the function u on the vertex set of Tq 
satisfy 

— u(j) = (1 — X)u(i) for all i G Li and some A. (10) 

Let f be obtained from L by doubling the motif Li, that is, by adding 
vertices i\, . . . , i m and their connections as in Li and connecting each i a 
with all i ^ Li that are neighbors of j a . Then the graph T possesses 
the eigenvalue A with an eigenfunction u x that is localized on Li and its 
double; it agrees with u on T\, with — u on the double of L 1; and vanishes 
on the rest of L. Thus, the eigenvalue A is produced from motif duplication 
with symmetric eigenfunction balancing. 

Not all eigenvalues possess localized eigenfunctions. Take cyclic graphs 
Li,L2 of lengths Am — 1 and An + 1, for some positive integers m,n. Since 
the only cyclic graphs that admit the eigenvalue 1 are those of length Ak, nei- 
ther Li nor L 2 possesses the eigenvalue 1, but if we join them by identifying a 
vertex i a G Ti with a vertex j G L 2 the resulting graph L has 1 as an eigen- 
value. An eigenfunction has the value 1 at the joined vertex, and the values 
±1 occurring always in neighboring pairs at the other vertices of Li,L2, where 
the two neighbors of io = jo in Li both get the value —1, and the ones in L 2 
the value 1. Since the multiplicity of the eigenvalue 1 on L is 1, there exists no 
other linearly independent eigenfunction for the eigenvalue 1. Thus, the local 
construction of joining two graphs at a single vertex here produces an eigen- 
function that cannot be localized. 

As another example, we can take any two graphs Li,L2. Their disjoint union 
then has two components, and therefore, the multiplicity of the eigenvalue is 
2. One eigenfunction uq is = 1 on Ti and = on L 2 , and for the other one, 
vq, the roles of the components are reversed. When we now form a graph L 
by connecting some vertex io G Li to some vertex jo G L 2 by an edge, then 
the multiplicity of the eigenvalue Ao — becomes 1 because L is connected; the 
corresponding eigenfunction u is = 1. However, when both T\ and T 2 are large, 
the next|^] eigenvalue Ai of L is very small, and a corresponding eigenfunction 
is well approximated by one, v, that equals a positive constant on Ti and a 
negative constant on L 2 (satisfying X)ier niW (*) = 0)- Thus, it is a symmetric 
linear combination, v an antisymmetric one of the original eigenfunctions uq, vq, 
and also the eigenvalues are close. 

4 Properties of spectral plots and evolution hy- 
potheses 

Constructions like motif joining or duplication describe certain processes of 
graph formation that leave characteristic traces in the spectrum. This sug- 

2 assuming for simplicity that ri,r2 do not have small nontrivial eigenvalues themselves 
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gests that they can also serve useful roles for developing hypotheses about the 
evolution of a graph representing actual biological data. Of course, such hy- 
potheses then need to be biologically plausible as well. Let us consider some 
examples. The simplest version of motif duplication is the doubling of a single 
vertex j\ £ T. According to the general scheme, we add a new vertex i and 
connect iq with all neighbors of jo- This generates an eigenvalue 1, with an 
eigenfunction u\ that is localized at jo and io, wi(jo) = li u i(*o) = — 1- Thus, 
if the spectral plot of a graph has a high peak at the eigenvalue 1, a natural 
hypothesis is that this graph evolved via a sequence of vertex doubling. 
The next simplest case of a motif is an edge connecting two vertices ji,j 2 . ( 10 1 
then becomes 

J-u(j 2 ) = (l-X)u(j 1 ), — u(7i) = (l-A)«(i 2 ), (11) 

n jl 12 32 

with the solutions 

A± = l±— (12) 

Thus, the duplication of an edge produces the eigenvalues X±. These are sym- 
metric about 1. Also, when the degree of ji or j 2 is large, A± are close to 1. 
Thus, when the spectral peak at 1 is high, but not too sharp, and symmetric 
about 1, this is an indication that edge duplication has played some role in the 
evolution of the structure. 

Next, we connect an edge between vertices ji,j 2 to an existing graph T by con- 
necting both ji and j 2 via an edge to some vertex io £ T, or equivalcntly, we 
join a triangle with vertices j , j\, j 2 to T by identifying j with i e T. In that 
case, we produce the eigenvalue 3/2. An eigenfunction u for the eigenvalue 3/2 
satisfies u(ji) — l,u(j 2 ) — — 1, and vanishes elsewhere. Thus, again, it is local- 
ized. The same result obtains when we join the triangle by connecting jo and 
io by an edge instead of identifying them. A high multiplicity of the eigenvalue 
3/2 may then generate the hypothesis that such processes of triangle joining 
repeatedly occurred in the evolution of the structure. 

When in addition to the triangle jo,ji,j2 another triangle ko,ki,k 2 is joined 
by identifying both jo and fco with io € I\ we not only generate the eigen- 
value 3/2 with multiplicity 2, but also the eigenvalue 1/2, with an eigenfunction 
v(ji) — v(j 2 ) — l,v(ki) — v{k 2 ) — —1 and elsewhere. Again, such a feature 
when prominently observed in a spectral plot may induce a corresponding hy- 
pothesis. 

The described operations can also be of a global nature. For example, we can 
double the entire graph T; when T consists of the vertices pi,.. . ,Pn, we take 
another copy V with vertices qy , . . . , and the same connection pattern and 
connect each q a also to all neighbors of p a . The new graph T then has the 
same eigenvalues as T, plus the eigenvalue 1 with multiplicity N. This is bio- 
logically relevant, because there is some evidence for whole genome duplication 
|251 1291 132] . However, protein- protein interaction networks do have a high mul- 
tiplicity, but not of the order of half the system size [5 j . This is readily explained 
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by subsequent mutations after the genome duplication that destroy the symme- 
try and thereby reduce the multiplicity of the eigenvalue 1. Also, since graph 
duplication does not change Ai and A^r-i, the synchronization properties are 
not affected (see [20]). 



5 Examples of spectral plots 

We now exhibit spectral plots of different formal and biological networks. We 
convolve the eigenvalues with a Lorentz kernel, that is, we plot the graph of the 
function 

/O) = V tt ry-j 2 

(A,- - xY + Y 

where the A,, are the eigenvalues and we choose the parameter value 7 = -03(^] 
In Figure [l] we see an Erdos-Renyi random network, a Strogatz- Watts small- 
world network, and a Barabasi-Albert scale-free network. Each of these types 
has its very distinct shape, and this is not affected by varying the parameters 
underlying the construction schemes. In Figure [2] we then see a protein-protein 
interaction network and two neurobiological networks, and in Figure [3] we have 
a metabolic network, a food-web, and a transcription network. These are just 
examples, and choosing other examples from the same category yields very sim- 
ilar shapes. As we directly see, however, shapes of spectral plots for networks 
from different biological realms are very different from each other and from the 
formal networks, even though those have been suggested to capture important 
aspects of biological networks. Clearly, this indicates that for analysing biolog- 
ical networks, it does not suffice to rely on some generic formal construction 
scheme. Rather, one needs to analyse the specific aspects of specific biological 
realms through formal methods that are sufficiently rich to capture the essential 
qualitative features of that biological domain. In this paper, we have proposed 
spectral analysis as such a method. 
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(c) 

Figure 1: Specral plots of generic networks, (a) Random network by Erdos 
and Renyi's model [13] with p = 0.05. (b) Small-world network by Watts and 
Stogatz model [3D] (rewiring a regular ring lattice of average degree 4 with 
rewiring probability 0.3). (c) Scale-free network by Albert and Barabasi model 
[7] (mo = 5 and m = 3). Size of all networks is 1000. All figures are ploted with 
100 realization. 



10 




0.5 1 1.5 2 2.5 3 3.5 4 0.5 1 1.5 2 2.5 3 3.5 4 



(a) (b) 



0.015 




(c) 

Figure 2: Spectral plots of (a) protein-protein interaction network of 
Saccharomyces cerevisiae (yeast). Network size is 1458. Data down- 
loaded from http://www.nd.edu/~networks/ and data used in |17j [down- 
load date: 17th September, 2004] (b) neuronal connectivity of C. ele- 
gans. Size of the network is 297. Data used in [301 131] • Data Source: 
http://cdg.columbia.edu/cdg/datasets/ [Download date: 18th Dec. 2006]. (c) 
neuronal connectivity of C. elegans from the animal JSH, L4 male in the nerve 
ring and RVG regions. Network size is 190. Data source: Data is assembled 
by J. G. White, E. Southgate, J. N. Thomson, S. Brenner [31] and was later 
revisited by R. M. Durbin (Ref. http://elegans.swmed.edu/parts/ ). [Download 
date: 27th Sep. 2005]. 
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Figure 3: Spectral plots of (a) metabolic network of A. pernix. Network size is 
490. Here nodes are substrates, enzymes and intermediate complexes. Data 
used in [16]. Data Source: |http: / /www. nd.edu/ ~networks/resources.htm/ 
[Download date: 22nd Nov. 2004]. (b) food- web from "Ythan estuary" 
Size of the network is 135. Data downloaded from |http://www.cosin.org/| 
[Download Date 21st December, 2006]. (c) transcription network of E. coli. 
Size of the network is 328. Data source: Data published by Uri Alon 



( http : / / www . weizmann . ac . il/mcb /Uri Alon / 
2004]. Data used in 



). [Download date: 13th Oct. 
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